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Abstract 



We consider conditional exact tests of factor effects in designed experiments 
for discrete response variables. Similarly to the analysis of contingency tables, 
Markov chain Monte Carlo methods can be used for performing exact tests, espe- 
cially when large-sample approximations of the null distributions are poor and the 
enumeration of the conditional sample space is infeasible. To construct a connected 
Markov chain over the appropriate sample space, a common approach is to compute 
a Markov basis. Theoretically, a Markov basis can be characterized as a generator 
of a well-specified toric ideal in a polynomial ring and is computed by computa- 
QQ . tional algebraic softwares. However, the computation of a Markov basis sometimes 

I becomes infeasible even for problems of moderate sizes. In this paper, we obtain 

' the closed form expression of minimal Markov bases for the main effect models of 

_ • . 2^~^ fractional factorial designs of resolution p. 

O 

cn 

1 Introduction 

^ ' In the past decade, various new applications of computational algebraic techniques to 
^ ■ statistical problems have been developed. One of the first work in this field, called com- 

■ ■ ■ putational algebraic statistics, is by Diaconis and Sturmfels ([5]). In this work, they 

introduced the notion of a Markov basis and presented a procedure for sampling from 
discrete conditional distributions by constructing a connected, aperiodic and reversible 
Markov chain on a given sample space. Since then, many works have been published on 
the topic of Markov basis by both algebraists and statisticians. Intensive results on the 
structure of Markov bases for various statistical models are given in [2] . 

This paper is based on the two works in this field, [1] and [3], which considered the 
Markov basis in the context of designed experiments. In these works, Markov chain Monte 
Carlo methods for testing factor effects are discussed, when observations are discrete and 
are given in the two-level or three-level regular fractional factorial designs. As one of the 
contributions of these works, the relation between the statistical models for the regular 
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fractional factorial designs and contingency tables is considered through Markov bases. 
As a consequence, to investigate the Markov bases arising in the problems of designed 
experiments, we can refer to the known results on the corresponding models for the 
contingency tables. For example, we see that the Markov basis for the main effect models 
of the regular 2jj^^ design given by the defining relation ABD = ACE = I is constructed 
only by the square- free degree 2 elements (l4j). This is because the corresponding model 
in the contingency tables is the conditional independence model in the 2x2x2 table. 
Note that the conditional independence model in the three-way contingency table is an 
example of decomposable models and we know the fact that a minimal Markov basis for 
this class of models can be constructed only by square-free degree 2 elements. See ^ for 
detail. On the other hand, the structure of the Markov bases for the models that does 
not correspond to the decomposable models is not known in general. 

In this paper, following the Markov chain Monte Carlo approach in the designed 
experiments by [1] and [3], we give a new result on the structure of the minimal Markov 
basis for the main effect models of 2^"^ fractional factorial designs of resolution p for p 
two-level factors. We show the minimal Markov basis for this case can be constructed by 
square-free degree 2 elements. 

The construction of this paper is as follows. In Section [2] we review the Markov chain 
Monte Carlo approach for testing the fitting of the log-linear models when observations 
are discrete random variables. The main results of this paper are given in Section 121 We 
show the minimal Markov basis for the main effect models of 2^"^ fractional factorial 
designs of resolution p for p two-level factors can be constructed by square-free degree 2 
elements. 

2 Markov chain Monte Carlo method for regular two- 
level fractional factorial designs 

In this section we introduce Markov chain Monte Carlo methods for testing the fitting of 
the log-linear models for regular two-level fractional factorial designs with count obser- 
vations. We consider designs of p factors with two-level. We write the observations as 
y = {yi, . . . , Uk)', where k is the run size and ' denotes the transpose. We suppose that 
the observations are counts of some events. We also suppose that only one observation is 
obtained for each run for simplicity. This assumption is natural for the setting of Poisson 
sampling scheme, since the set of the totals for each run is the sufficient statistics for the 
parameters. Write the k x p design matrix D = [dij), where dij G { — 1,1} is the level of 
the j-th. factor in the z-th run for i = 1, . . . , k, j = 1, . . . ,p. For example of 2fy^ fractional 
factorial design, the design matrix D is given as 



D = 
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For the count observations, it is natural to consider the Poisson distribution as the sam- 
pling model, in the framework of generalized linear models ([H])- The observations y 
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are realizations from k Poisson random variables Yi, . . . , Y^, which are mutually inde- 
pendently distributed with the mean parameter /Xj = E(Yi),i = l,...,k. We call the 
log-linear model written by 



logyUi = /3o + Pidii H h Ppd, 



l,...,k 



'I) 



as the main effect model in this paper. The equivalent model in the matrix form is 

/ log Ail \ 



where {3 = {(3q, . . . , and 



M 



/ 1 



D 



(2) 



We call the k x [p + 1) matrix M a model matrix of the main effect model. The inter- 
pretation of the parameter /3j in ([1]) is the parameter contrast for the main effect of the 
j-th factor. In models including various interaction effects are also considered. In this 
paper, we focus on the main effect model. 

To judge the fitting of the main effect model ([T]), we can perform various goodness- 
of-fit tests. In the goodness-of-fit tests, the main effect model ([1]) is treated as the null 
model, whereas the saturated model is treated as the alternative model. Under the null 
model ([1]), /3 is the nuisance parameter and the sufficient statistic for (3 is given by M'y = 
iYli=iyi^Yli=idiiyi, ■ ■ ■ ,Yli=idipyiy- Then the conditional distribution of y given the 
sufficient statistics is written as 

k 



/(y I M'y = MY 



1 



C(M'y°; 



(3) 



where y° is the observation count vector and C(M'y°) is the normalizing constant deter- 
mined from M'y° written as 



C{M'y' 




(4) 



and 



J^{M'y°) = {y I M'y = M'y", Vi is a nonnegative integer for i = 1, . . . , k}. (5) 



Note that by sufficiency the conditional distribution does not depend on the values of the 
nuisance parameters. 

We consider various goodness-of-fit tests based on the conditional distribution (|3]). 
There are several ways to choose the test statistics. For example, the likelihood ratio 
statistic 

k 



Tiy) = G'iy) = 2J2y^^og 



i=l 



(6) 
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is frequently used, where fii is the maximum hkehhood estimate for /ij under the null 
model (i.e., fitted value). Note that the traditional asymptotic test evaluates the upper 
probability for the observed value T{y°) based on the asymptotic distribution xl-p-i- 
However, since the fitting of the asymptotic approximation may be sometimes poor, we 
consider Markov chain Monte Carlo methods to evaluate the p values. Using the condi- 
tional distribution the exact p value is written as 

p= f(y I ^'^'y = MYmny) > T(y°)), (?) 

y6J-(A/'y°) 

where 

1, ifr(y)>r(y° 



^ ^^yy )) | g, otherwise. 



Of course, if we can calculate the exact p value of (171) and (IHl), it is best. Unfortunately, 
however, an enumeration of all the elements in J^{M'y°) and hence the calculation of 
the normalizing constant C{M'y°) is usually computationally infeasible for large sample 
space. In such situations, various Monte Cairo methods are effective. In particular, we 
consider a Markov chain Monte Carlo method in this paper. Note that, as one of the 
important advantages of Markov chain Monte Carlo method, we need not calculate the 
normalizing constant (j4]) to evaluate p values. 

To perform the Markov chain Monte Carlo procedure, we have to construct a con- 
nected, aperiodic and reversible Markov chain over the conditional sample space ([5]) with 
the stationary distribution ([3]). If such a chain is constructed, we can sample from the 
chain as y^^\ . . . , y*-"^^ after discarding some initial burn-in steps, and evaluate p values as 



p = i5^1(T(yW)>T(y")). 



t=i 



Such a chain can be constructed easily by Markov basis. Once a Markov basis is calculated, 
we can construct a connected, aperiodic and reversible Markov chain over the space (Q, 
which can be modified so that the stationary distribution is the conditional distribution 
([3]) by the Metropohs-Hastings procedure. See [5] and [7] for details. We give a formal 
definition of the Markov basis in the next section. 

Markov basis is characterized algebraically as follows. Write indeterminates Ui, . . . ,Uk 
and consider polynomial ring K[ui, . . . ,Uk] for some field K. Consider the integer kernel 
of the transpose of the model matrix M, Ker^M'. For each h = [bi, . . . , bk)' G Ker^M', 
define binomial in K[ui, . . . , uA as 



/b = n ^? - n 

6,>0 bi<0 



Then the binomial ideal in K[ui, . . . , Mfc], 

I{M') = ({/b I b G Ker^^M'}) , 

is called a toric ideal with the configuration M' . Let {/b(i)5 ■ ■ ■ ■, fw)} be any generating 
set of I{M'). Then the set of integer vectors {h^^\ . . . , b^'')} constitutes a Markov basis. 
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See [5] for detail. To compute a Markov basis for given configuration M', we can rely 
on various algebraic softwares such as 4ti2 ([!]). However, the computation of a Markov 
basis sometimes becomes infeasible even for problems of moderate sizes. In this paper, 
we give the closed form expression of minimal Markov bases for the main effect models of 
2^~^ fractional factorial designs. 

3 Minimal Markov basis for the main effect models 
of 2^^^ fractional factorial designs of resolution p 

3.1 Markov basis 

As we have seen in the previous section, if we can construct a connected Markov chain over 
J-'(M'y°) for the observation y°, we can estimate the conditional p value and judge the 
fitting of the main effect model. One of the common approach to construct such a chain is 
to calculate a Markov basis for M' in advance. Here we give a definition of Markov basis 
and necessarily notation on Markov basis. For detail, see [2], for example. We denote the 
set of integers and nonnegative integers by Z = {0, ±1, ±2, . . . , } and N = {0, 1, 2, . . . , }, 
respectively. We also denote the set of n-dimensional vectors of elements from Z and N 
by Z" and N", respectively. 

We call the set J^{M'y°) of ([5]) a fiber, z e Z'^ is called a move for M' if M'z = 0. If 
M'x = t and z is a move for M' , we have M'(x + z) = Af x = t. Therefore by adding z to 
X, we remain in the same fiber as long as x + z does not contain a negative element. This 
is the reason that z is called a move. Suppose we are given a finite set of moves B. For 
each y G N'^, we consider an undirected graph G{y°,B) whose vertices are the elements 
of a fiber J-'(M'y°). We draw an undirected edge between x and y in J^{M'y°) if there 
exists z G such that y = x + zory = x — z. Markov basis is defined as follows. 

Definition 1. A finite set B of moves for M' is called a Markov basis for M' is G{y", B) 
is connected for every y° G N'^. 

Suppose i3 is a Markov basis for M' . Then for any x, y{^ x) G J^{M'y°) for any 
y° G N'^, there exists a sequence of moves zi, . . . ,zl from B and ej = ±1, j = 1, . . . , L, 
such that y = X + J2j=i ^j'^j 



i.e., we can move from x to y by moves from B without causing negative elements on the 
way. In this case, we say that y is accessible from x by i3 and denote this by x ~ y (mod B). 
Obviously the notion of accessibility is symmetric and transitive. Allowing moves to be 
also yields refiexivity. Therefore accessibility by B is an equivalence relation and each 
fiber J-'{M'y°) is partitioned into disjoint equivalence classes by moves of B. We call these 



h 




h 



1,... 



L-1 



5 



equivalence classes S-equivalence classes of J-'(M'y°). Since the notion of accessibility is 
symmetric, we also say that x and y are mutually accessible by i3 if x ~ y (mod B). 

A Markov basis B is minimal if no proper subset of i3 is a Markov basis. A minimal 
Markov basis always exists, because from any Markov basis, we can remove redundant 
elements one by one, until none of the remaining elements can be removed any further. 
As for the structure of the minimal Markov basis, see Chapter 5 of P]. 



3.2 Theorem and proof 

Now we give the main result of this paper. We consider the main effect model of 2'^~^ 
fractional factorial design of resolution p for p two-level factors. Therefore the run size is 
k = 2P-^ and the 
p = 4 is given by 



(9) 



model matrix M 


is A; X 


b + 1) = 


2P- 


^ X (p + 1) 




( +1 +1 


+1 


+ 1 +1 


+1 


+1 +1 \ 




+ 1 +1 


+1 


+ 1 -1 


-1 


-1 -1 


M = 


+ 1 +1 


-1 


-1 +1 


+1 


-1 -1 




+ 1 -1 


+1 


-1 +1 


-1 


+1 -1 




\+l -1 


-1 


+1 -1 


+1 


+1 -I I 



A move z = (2:1, . . . , 2^)' for M' is called a square- free degree 2 move if 



+1, 
-1, 
0, 



for i = ii,i2, 
for i = 13, ii, 
otherwise 



for some distinct ii, «2, ^3, "^4 ^ {1, • • • , Our main theorem is as follows. 

Theorem 1. A minimal Markov basis for M' of the main effect model of 2^"^ fractional 
factorial design of resolution p is constructed as the set of square-free degree 2 moves. 

To show this theorem, we prepare necessary notation. We treat the frequency vector 
X G N'^ as the {p — l)-dimensional contingency table, i.e., 2^~^ table. Write the levels of 
each axis as {0,1} and let 

X = {i} = {(21 ■ ■ • = {0, 1} X ■ ■ ■ X {0, 1} 

denote the set of the cells. The frequency vector x G is then written as 

X = {x(i)}iei = {x{ii . . . im G {0, 1}, z = 1, . . . , m). 

Since x has many zero entries, it is useful to express x with total tj, as x = ii ■ ■ ■ i„, where 
ii, . . . , i„ are the cells of positive frequencies of x. In the case x(i) > 1, i is repeated x(i) 
times. Similarly, we express a move z as z = ii ■ ■ ■ in — ji ■ ■ ■ jn, where ii, ■ ■ ■ , are the 
cells of positive elements of z and ji, ■ ■ ■ , jn are the cells of negative elements of z. We 
also write the positive part and the negative part of z = ii ■ ■ ■ i„ — ji ■ ■ ■ j„ as z+ = ii ■ ■ ■ i„ 
and z~ = ji ■ ■ ■ j„, respectively. Let X = Xq U Xi and Xq fl Xi = 0, where 



Xn 



[ii ■ ■ -ip-i) 



Zm = (mod 2) 



m=l 
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m=l 

We define the one-dimensional sum of x, as 



«m = 1 (mod 2) 



^lv)^m — l)^m+lv"5^p — 1 

We also define the diagonal sum of x as 

XDiO) = ^x{i), XD{l) = ^x{i). 

Then the sufficient statistics for the observation x is written as 

M'x = (xi(0), xi(l), . . . , a;p_i(0), xb(0), xz)(l))'. (10) 

We also use this marginal notation for moves. Therefore for a move z, we have Zm{im) = 
for m = 1, . . . ,p — 1, im = 0,1 and Z£,{im) = for = 0, 1. We write levels {im^m} 
represent that {imii*m) is (0,1) or (1,0). 

First we show an important structure of moves for M' for the case of p = 4. 

Lemma 1. Any z = {2;(ii22^3)} ^ with either of the following types cannot he a move 
for M' given by 

• z{iii2iz), z{i\i2iz), 2;(«t^2^3), z{ili2il) > for some {iii2i'i)- 

• z{iii2iz), z{iii*2iz), ziili^i^i) , z{iiilil) > for some (iii2^3)- 

• z{iii2i3), ^(^l^2^3), z{ili2il), ziiiilil) > for some {iii2iz)- 

Proof. Straightforward. □ 



Now we show Theorem [H In the following proof, we use the method of the distance 
reduction. (See Chapter 6 of [2], for example). 

Proof of Theorem [T], Write the set of square-free degree 2 moves for M' as B* . To 

show that B* is a Markov basis for M', suppose B* is not a Markov basis. Then for 
some y° e N'^, J^(M'y°) consists of more than one i3* -equivalence classes. Let J^i and J-2 
denote two different jB*-equivalence classes. Choose x G J-*! and y G J-2 such that 



is minimized. Because x and y are chosen from different B* classes, this minimum has to 
be positive. We show a contradiction by induction on p. 
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Consider the case of p = 4. Suppose 2;(000) > without loss of generahty. Here we 
consider z(lll). If 2^(111) > 0, then ?/(000), > follows. Since z is a move, z also 

has negative entries, x{iii2i^) < for some (2122^3)- Then we can subtract the move 

zi = (000)(111) - {t,i2i3){tli*2tl) G B* 

from y without causing negative entries. However, we have |(y — zi) — x| < |y — x| since 
Zi and X has a common support, which contradicts the definition of x,y. Then we have 
z{lll) < 0. Next we suppose 2;(001) < 0. In this case, by the symmetry (i.e., same reason 
to the above), we can derive a contradiction if z{110) < 0. Then we have 2;(110) > 0. 
Next consider 2;(010). 

• Suppose 2;(010) < 0. We have 2;(101) > by symmetry. From ^^(0) = 0, we 
have 2(011) < 0. From 2d(1) = 0, we have z(lOO) > 0. From zi{l) = 0, we have 
;z(lll) < 0. However, ^(001), z(OlO), z(Oll), z(lll) < contradicts that -z is a 
move for M' by Lemma [H 

• Suppose 2(010) > 0. From 23(0) = 0, we have 2(100) < 0. By symmetry, we 
have 2(011) > 0. From 22(1) = 0, we have 2(111) < 0. From z^iO) = 0, we have 
2(101) < 0. However, 2(001), 2(100), 2(101), 2(111) < contradicts that -z is a 
move for M' by Lemma [H 

Then we have shown for p = 4. 

Next suppose p > 5 and the theorem holds for the cases up to p — 1. We begin with 
the case that some of M' y° is zero. Without loss of generality, we have two cases. 

• In the case of y°_i{l) = 0, define the set 

{y I y{H . . . ip^2) = x{h ■ ■ ■ ?p_20) for x G -F(M'y°) } • (11) 

Then we have 

ymi^rn) ■^mij-m) ^ 1; • • • i J* 2, 0, 1, 

yoiim) = Xoiira), V = 0, 1. 

Therefore from the definition of the fiber, the set ffTTl) coincides the fiber for 

(xi(0),a;i(l), . . . ,Xp_2(0),a:p„2(l),a;D(0),XD(l))'. 

From the assumption of the induction, this fiber is connected by the set of square- 
free degree 2 moves B C Z^'' . Then the set of square-free degree 2 moves 



z{ii ■ ■■ip-2), if ip-i = 
ip-i 



connects the fiber for M'y°. 
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• In the case of = 0, define the set 



y{ii . . . ip-30) = 
y{ii . . .ip-al) = 



Then we have 



x{ii ■■■ip. 


-3OO), 


if ii + ■ 


■ ■ ip-3 = 


(mod 2) 


x{ii ■ ■ ■ ip_ 


-3OI), 


if ii + ■ 


■ ■ ip-3 = 


1 (mod 2) 


x{ii ■■■ip. 


-3II), 


if *i + ■ 


■ ■ ip-3 = 


(mod 2) 


x{ii ■ ■■ip- 


-3IO), 


if ii + • 


• ■ ip-3 = 


1 (mod 2) 


■^m{im, 


, m = 


l,...,p 


2, ijfi 


= 0,1 



for X e T{MY) 
(12) 



yp-2{im) = Xp-2{im), = 0, 1 

yoiim) = Xp-i{im), im = 0, 1. 

Therefore we can also show that the set of square- free degree 2 moves connects the 
fiber for M'y" similarly from the assumption of induction. 

From the above consideration, we can restrict our attention to the cases that all 
the elements of M'y° are positive. Suppose J^{M'y°) is such a fiber and return to the 
consideration of z = y — x for x G J-*!, y G ^-"2, where J^i and J-2 are two-different B*- 
equivalence classes of J^{M'y°). We have to consider the cases that p (> 5) is even or 
odd, respectively. 

Suppose p is even. Because we suppose |z| is positive, we can suppose z{0 • • • 0) > 
without loss of generality. Because all the elements of M'y° are positive, there are at least 
one more positive entries in z. If z{l ■ ■ ■ 1) > 0, we can obtain z{ii ■ ■ ■ ip-i) < for some 
(ii • ■ ■ ip-i) G Xq. Then we can subtract a move 

zi = (0 • • • 0) (1 • • • 1) - (ii • • • v_i) (zt • • • z;_i) G B* 

from y without causing negative entries and we have |(y — Zi) — x| < |y — x|, which 
is contradiction. Then we can suppose z{0 • • • 0) < and we can choose other positive 
entries in z. Without loss of generality, we can suppose 

^(^_^1_^) > 0, 

p—l—r r 

where r is an odd and p— 1 — r>2isan even. Because the one-dimensional totals 
y°(0), . . . , yp_i_r{0) are positive, z has another positive element. Suppose z has a positive 
element expressed as 

z{ *l * 1* * ' ^' > 0- 

p—l—r r 

Rewrite this as 

p—3—r r 

without loss of generality. Then we can subtract a move 

Z2 = (0_^00 1_^)(*_^11*_^) - (^_^11 1_^)(*_^00*_^) G B* 

p—3—r r p—3—r r p—3—r r p—3—r r 
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from y without causing negative entries. Note that |(y — Z2) — x| is not necessarily smaller 
than |y — x|. However, (y — Z2) — x = z — Z2(= z*) has positive entries 

z*{0 ■ ■ ■ 0) > 0, ^*(0_^1_^) > 0. 

p— 3— r r 

Therefore repeating this procedure, we can reduce the pattern of the positive part of z as 
(0 ■ ■ • 0) (p ■ - Q 1 - -1) (10 -J ■ (010^ ■ ■ ■ ^ ■ >k) ■ ■ ■ (0_^;^*j^)- 

p—l—r r p—l—r r p—l—r r p—l—r r 

Now consider the cells 

(10^^*_^), (01CI^__0*_^), • • ■ , (0_^;01 *_^). 

p—l—r r p—l—r r p—l—r r 

If p — 1 — r > 2, there are two cells in these p — l — r cells where both of them have the 
same level in the same axis in the latter r axes, such as 

p—l—r r p—l—r r 

or 

(10^^1^_^j_^), (010^ - - 01 * - ■*)■ 

p—l—r r p—l—r r 

In the former case, three-dimensional marginal table of z"*" with respect to ii,i2,ip-r has 
positive elements 

(000)(001)(100)(010), 

which contradicts Lemma [H Similarly, in the latter case, three-dimensional marginal 
table of z+ with respect to zi, Z2, ip-r has positive elements 

(000)(001)(101)(011), 

which also contradicts Lemma [H Therefore we only have to consider the case that p — 
1 — r = 2 and the positive entries of z is written as 

(000 ■ ■ ■ 0)(001 ■ ■ ■ l)(10^3 ■ ■ ■ ip-i)i01il ■ ■ ■ i;„i). 

In this case, we can subtract a move 

(10z3 ■ ■ ■ ip-i)iOli; ■ ■ ■ - (100 ■ ■ ■ 0)(011 ■■■l)eB* 

from y without causing negative entries and we have positive elements 

(000 ■ ■ ■ 0)(001 ■ ■ ■ 1)(100 ■ ■ ■ 0)(011 ■ ■ ■ 1). 

Here, considering collapsing table with respect to i^, . . . ,ip_i axes such as (0 ■ ■ ■ 0) — )■ 
and (1 ■ ■ ■ 1) — )■ 1, we again have the p = 4 case and can derive a contradiction from 
Lemma [H Therefore we have proved for the case that p is even. 

On the other hand, for the case that p is even, we focus on (01 ■■■1) instead of 
(1 ■ ■ ■ 1) and can follow the arguments above in similar way. Then we complete the proof 
of Theorem □ 
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3.3 Structure of minimal Markov basis 

From the proof of Theorem [H the structure of minimal Markov basis is revealed for the 
main effect models of 2p~^ fractional factorial designs of resolution p. We call a fiber 
J-'(M'y°) as degree 2 fiber if Xligx = 2. As a consequence of Theorem [H we see that 
the degree 2 fiber plays an important role to construct a Markov basis. 

Corollary 1. A minimal Markov basis for M' of the main effect models of2P~^ fractional 
factorial design of resolution p is constructed as the set of moves connecting all degree 2 
fibers with more than one elements. 

From this corollary, we can characterize the structure of minimal Markov basis consid- 
ering all degree 2 fibers with more than one elements. We call such a fiber as an essential 
fiber below. We show some first steps as follows. Recall that the sufficient statistics is 
given as f lTOj) . In the following, deleting some commas from ffTOj) . we write the sufficient 
statistics as 

M'x = (a;i(0)xi(l), . . . , Xp_i(0)a;p_i(l), Xz5(0)a;i5(l))'. 

• Case of p = 4. Only essential fiber is given as 

^(11,11,11,11) = {(111)(222), (112)(221), (121)(212), (122)(211)}. 
To connect this 4-elements fiber, 3 moves such as 

{(111)(222) - (112)(221), (111)(222) - (121)(212), (111)(222) - (122)(211)} 
constitute a minimal Markov basis. 

• Case of p = 5. There are 10 essential 4-elements fibers such as 

J^(20, 11, 11, 11, 11), T{02, 11, 11, 11, 11), . . . , J^(ll, 11, 11, 11, 20), 11, 11, 11, 02). 

To connect these 4-elements fibers, 3 moves for each fiber are needed. Therefore the 
minimal Markov basis is constructed as 3 x 10 = 30 moves. 

• Case of p = 6. There are 60 essential 4-elements fibers such as 

J^(20, 20, 11, 11, 11, 11), . . . , J^(ll, 11, 11, 11, 02, 02) 

and one essential 16-elements fiber 

J^(ll,ll,ll,ll,ll,ll). 

To connect these fibers, 3 moves for each 4-elements fiber and 15 moves for the 
16-elements fiber are needed. Therefore the minimal Markov basis is constructed as 
3 X 60 + 15 = 195 moves. 
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• Case oi p = 7. There are 280 essential 4-elements fibers such as 

^■(20, 20, 20, 11, 11, 11, 11), . . . , ^^(ll, 11, 11, 11, 02, 02, 02) 
and 14 essential 16-elements fiber such as 

^■(20, 11, 11, 11, 11, 11, 11), . . . , ^^(ll, 11, 11, 11, 11, 11, 02). 
The minimal Markov basis is constructed as 3 x 280 + 15 x 14 = 1050 moves. 

• Case of p = 8. There are 1120 essential 4-elements fibers such as 

^■(20, 20, 20, 20, 11, 11, 11, 11), . . . , T{n, 11, 11, 11, 02, 02, 02, 02), 
112 essential 16-elements fiber such as 

J'(20, 20, 11, 11, 11, 11, 11, 11), . . . , T{n, 11, 11, 11, 11, 11, 02, 02) 
and one essential 64-elements fiber 

^•(11, 11, 11, 11, 11, 11, 11, 11). 
The minimal Markov basis is constructed as 3 x 1120 -|- 15 x 112 -|- 63 = 5103 moves. 
We summarize the number of essential fibers as follows. 

Proposition 1. A minimal Markov basis for M' of the main effect models of 2''^^ frac- 
tional factorial design of resolution p is constructed as 

I I < moves connecting 4*~^ { , ^ x I essential 4*~*-elements fibers > 

,yi V2(i-i); J 

for p — 2s, and 

I I < moves connecting 2^*"^ 1^1 essential 4*~* -elements fibers > 
for p = 2s -|- 1. 
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